Avoid memory leak in unit test driver #4249

roystgnr · 2025-09-10T13:59:13Z

If we add only a subset of tests to the runner (via --re or --deny_re options), we're careful to move the other tests to a "rejects" suite so they'll get deleted there, but the "supertest", the suite holding all the other tests, wasn't getting deleted in those cases.

This (on top of my earlier fixes, and using the MOOSE suppressions file for third-party library issues) gets selective valgrind runs of our unit tests clean for me. (all-tests runs are clean either way)

This isn't urgent to merge; I still need to run through our examples to look for valgrind issues too. This tiny leak is only a problem because it upsets valgrind, and until we're sure we're ready to make that be a big deal (add a --error-exitcode= option to our valgrind recipes), "do our unit tests upset valgrind" isn't a critical question.

roystgnr · 2025-09-10T14:03:07Z

Scratch "gets selective valgrind runs of our unit tests clean for me" - I'm still seeing something else, and I'm not sure if it's just something I missed before or a regression from this PR.

jwpeterson

I had some questions that could probably be cleared up if I read the cppunit docs, but I chose not to 🤷‍♂️

jwpeterson · 2025-09-10T14:18:41Z

tests/driver.C

                                 runner, rejects);
  if (n_tests_added >= 0)
    libMesh::out << "--- Running " << n_tests_added << " tests in total." << std::endl;
+  if (n_tests_added != -12345)


Minor comment, but since this is our own magic number that we now actually have to refer to, it might make sense to use libMesh::invalid_uint or some other named constant.

jwpeterson · 2025-09-10T14:22:31Z

tests/driver.C

  if (n_tests_added >= 0)
    libMesh::out << "--- Running " << n_tests_added << " tests in total." << std::endl;
+  if (n_tests_added != -12345)
+    owned_suite.reset(suite);


I guess I don't understand why we don't need to clean up suite when there's no tests added? From a surface level reading of the code it just looks like registry.makeTest() returns a dumb pointer whose lifetime we are expected to manage, and we were always leaking it before...

This should clean up suite when there's no tests added. That'll return n_tests_added == 0, and 0 != -12345, so we put suite in our unique_ptr and it gets cleaned up.

moosebuild · 2025-09-10T18:26:01Z

Job Coverage, step Generate coverage on 76de8d1 wanted to post the following:

Coverage

	c144a6	#4249 76de8d
	Total	Total	+/-	New
Rate	65.26%	65.26%	-0.01%	-
Hits	77376	77370	-6	0
Misses	41184	41190	+6	0

Diff coverage report

Full coverage report

This comment will be updated on new commits.

roystgnr · 2025-09-10T20:06:45Z

There's definitely something weird going on here. This patch cleans up the leaked suite from TestFactoryRegistry::makeTest(), but it starts complaining about a test suite destructor hitting invalid accesses to already-freed memory from TestSuiteFactory<AllSecondOrderTest>::makeTest() in particular.

If every test class was complaining then I'd be sure I'm trying to do a double-free here, once from the runner and then once from recursive destruction from the suite, but it's just AllSecondOrderTest complaining?

No, wait, it's too much of a coincidence that the failing AllSecondOrderTest is first alphabetically. I must be trying to do a double-free here, but something about the first UB causes the destructor to skip the rest.

If we add only a subset of tests to the runner (via --re or --deny_re options), we're careful to move the other tests to a "rejects" suite so they'll get deleted there, but the "supertest", the suite holding all the other tests, wasn't getting deleted in those cases. This (on top of my earlier fixes, and using the MOOSE suppressions file for third-party library issues) gets selective valgrind runs of our unit tests clean for me.

roystgnr · 2025-11-26T17:29:48Z

I think I've managed to square the circle of "we can't add a test to a runner without passing ownership of the pointer to the runner" and "when we get a test from a suite there's no way to take ownership of it away from the suite": we get each test from the suite, we wrap it in our own TestShim that passes along every API call except the destructor, and we hand the shim to the runner. The runner deletes the shims, then we can delete the suite to delete the non-shimmed tests, and nothing gets destroyed 2 times or 0 times.

I've added our Valgrind (unit tests) recipe, and I'm testing with valgrind myself; if nothing screams then this will finally be ready to merge.

roystgnr added the do not merge label Sep 10, 2025

jwpeterson approved these changes Sep 10, 2025

View reviewed changes

roystgnr added 2 commits November 24, 2025 11:43

TestShim to ensure every test is destructed once

0d0d8d1

roystgnr force-pushed the dont_leak_test_suite branch from f452222 to 0d0d8d1 Compare November 24, 2025 22:08

Give a name to our test driver "magic" returnval

76de8d1

roystgnr removed the do not merge label Nov 26, 2025

roystgnr merged commit e21a264 into libMesh:devel Nov 27, 2025
22 checks passed

roystgnr deleted the dont_leak_test_suite branch November 27, 2025 00:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid memory leak in unit test driver #4249

Avoid memory leak in unit test driver #4249

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

jwpeterson left a comment

Uh oh!

jwpeterson Sep 10, 2025

Uh oh!

jwpeterson Sep 10, 2025

Uh oh!

roystgnr Sep 10, 2025

Uh oh!

moosebuild commented Sep 10, 2025 •

edited

Loading

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Avoid memory leak in unit test driver #4249

Avoid memory leak in unit test driver #4249

Conversation

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

jwpeterson left a comment

Choose a reason for hiding this comment

Uh oh!

jwpeterson Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

jwpeterson Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

roystgnr Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

moosebuild commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Nov 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

moosebuild commented Sep 10, 2025 •

edited

Loading